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Abstract 

Motivated by the practical challenge in monitoring the performance of a large num- 
ber of algorithmic trading orders, this paper provides a methodology that leads to 
automatic discovery of causes that lie behind poor trading performance. It also gives 
theoretical foundations to a generic framework for real-time trading analysis. The com- 
mon acronym for investigating the causes of bad and good performance of trading is 
TCA (Transaction Cost Analysis [Rosenthal, 20 09]). Automated algorithms take care 
of most of the traded flows on electronic markets (more than 70% in the US, 45% in 
Europe and 35% in Japan in 2012). Academic literature provides different ways to 
formalize these algorithms and show how optimal they can be from a mean-variance 
[Almgren and Chriss, 20 00] , a stochastic control [Gueant et al., 20"T2] , an impulse con- 
trol [Bouchard et al., 2011] or a statistical learning | Laruelle et al., 2011[ viewpoint. 
This paper is agnostic about the way the algorithm has been built and provides a 
theoretical formalism to identify in real-time the market conditions that influenced 
its efficiency or inefficiency. For a given set of characteristics describing the mar- 
ket context, selected by a practitioner, we first show how a set of additional derived 
explanatory factors, called anomaly detectors, can be created for each market order 
(following for instance [Basseville and Nikiforov, 1993[ ). We then will present an on- 
line methodology to quantify how this extended set of factors, at any given time, 
predicts (i.e. have influence, in the sense of predictive power or information defined in 
[Cristianini and Shawe-Taylor, 2000[ , [Shannon, 1948] and [Alkoot and Kittler, 1999] ) 
which of the orders are underperforming while calculating the predictive power of this 
explanatory factor set. Armed with this information, which we call influence analysis, 
we intend to empower the order monitoring user to take appropriate action on any 
affected orders by re-calibrating the trading algorithms working the order through new 
parameters, pausing their execution or taking over more direct trading control. Also 
we intend that use of this method in the post trade analysis of algorithms can be taken 
advantage of to automatically adjust their trading action. 
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1 Introduction 

Institutional investors use optimal dynamic execution strategies to trade large quantities of 
stock over the course of the day. Most of these strategies have been modelled quantita- 
tively to guarantee their optimality from a given viewpoint: it can be from an expectation 
[Bertsimas and Lo, 1998| , a mean- variance Almgren and Chriss, 2000] , a synchronized port- 



folio Lehalle, 2009 or a stochastic control [Bouchard et al., 2011 perspective 



Algorithmic trading analyzes high frequency market data, viewed as actionable infor- 
mation, in order to automatically generate and place automated trading orders. Hence, 
automatic trading algorithms can be viewed as parametrized black boxes. They are ini- 
tialized and monitored by human traders, who have the capability to set and adjust some 
high-level parameters that drive the algorithm - incorporating the a trader's view and al- 
lowing reaction to unexpected events. However standard automatic trading systems, do 
not currently offer advanced computerized discovery of potential causes for poor trading 
performance. Our paper focuses on automated online monitoring of portfolio performance 
by real-time scanning of static (like the sector, the country, etc, of the traded stock) and 
dynamic (like the bid-ask spread, the volatility, the momentum, the fragmentation of the 
traded stock) "explanatory" market variables quantifying their current influence on port- 
folio performance. The Influence Analysis methodology we have developed and tested can 
provide real-time feedback to traders by detecting the most significant explanatory factors 
influencing current degradations of trading performance on specific portfolios. 

TCA practitioners, in particular for real-time analysis, are faced with the problem of 
automating the understanding of how market context affects trading performance of a large 
number of orders. This paper is the first theoretical formalization of such a process proposing 
a framework to understand and improve TCA (off line or in realtime) in several aspects: (1) 



augmenting the description of the market context (using scoring -section 2.3- and pattern 
detection -section [3]-) to identify relationships between this enhanced description and the 
performance of a large basket of orders. A key feature of our influence analysis method- 
ology is that it does not require actual knowledge of the trading algorithms mechanisms. 
Online influence analysis could hence be useful in updating trade executions as well as in 
re-calibrating trading algorithms. Influence analysis on historical data could also improve 
existing trade scheduling algorithms, selecting new significant signals in their kinematics. 

Our online influence analysis generates in real-time a very short list of market fac- 
tors "explaining" the current lack of performance in a collection of intra-day trades. The 
automatic trading algorithms can be arbitrary; it can be a result of a classical mean- 
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variance optimization (see Almgren and Chriss, 2000] ), stochastic control of a brokerage 



benchmark (see Bouchard et al., 201 1| ), stochastic control of a market making scheme (see 



Gueant et al., 2012] ), a liquidity-seeking algorithm driven by a stochastic algorithm opti- 



mization (see Laruelle et al., 2011| ), or even purely heuristic driven ones. The analysis 



conducted here is supported by automatic detection of conjunction of poor trading quality 
and singularity or rarity of the market context. The influence analysis can be launched as 
soon as a portfolio performance evaluation criterion is selected, such as the slippage with 
respect to the arrival price, to the VWAP (volume- weighted average price), etc. Our online 
influence analysis relies on extensions of classical relative entropy techniques (see for instance 
BriUinger, 2004| [Mougeot and Azencott, 20TT] [Azencott et al., | [Bilhngsley, 1965| ) to gen- 



erate in real-time optimized empirical relations between an automatically selected small set 
of high influence explanatory factors and any pre-assigned trading performance criterion. 
Our approach also quantifies at each time t the current influence of a given pool of market 
factors on trading performance degradation. 

We first describe typical sets of market variables and trading performance criteria to 
which our influence analysis applies, and we outline our benchmark sets of intra-day data, 
used to test our approach. In section [2] we describe the market dataset considered for our 
study. We then present in section |3] the three online anomaly detectors we have developed 
to enrich in real-time any set of dynamic input market variables. Such detectors are bespoke 
implementation of a more generic class of detectors that could be used like wavelet coefficients 
IMallat, 2008] . Section |l]outlines our generic framework for influence analysis. We select and 
fix any pragmatic binary trading performance criterion Yt detecting low trading performance 
at time t. This binary criterion will be deduced from a variable of performance PEj of the 
trading portfolio. Given any small set G of explanatory variables X"^ = {Xj , j G G} (deduced 
from a multiscale analysis of market context descriptors {M/, 1 < j < J}), we generate the 
current "best" predictor Yt = ht{{X'-^}) of Yt based on these explanatory variables, and we 
compute its current predictive power, which we call the infiuence coefficient Jt{G) of the 
group G on Yt The time dependent set G™^^ which maximizes Jt{G) among all small groups 
G of explanatory factors is then determined, and if Jt{Gf^'^^) is high enough, the set G™"^ 
can be exported in real-time to traders, as the set of market variables which best explains 
current trading performance degradations. In section 6^ we present the accuracy analysis 
of the influence computation, and obtain pragmatic conditions for robust identiflcation of 
explanatory variables. We then present the steps to compute optimized parametric predictors 
based on single explanatory factors, and to generate hierarchical combinations of optimal 
predictors. Section [9] presents the test results of our influence analysis on benchmark intra- 
day datasets provided by Credit Agricole Cheuvreux Quantitative Research Group. 
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2 Dynamic dataset to be monitored online 



2.1 The automated trading process 

Only few assumptions are demanded to a trading process to be monitored by the methodology 
proposed here. By ^Hrading process''^ we mean the operation of buying or selling shares or 
any other financial instrument in more than one transaction. Since the "last leaf" of any 
investment or hedging strategy is to obtain transactions, a trading process is needed if 
a block execution cannot be obtained. With the conjunction of the financial crisis and 
regulation changes (mainly Reg NMS in the US in 2005 and MiFID in Europe in 2007, 
see |Burgot et al., 20131 for more details), the capability to close a deal in one transaction 
strongly decreased. Hence most market participants are ''slicing" their large orders (see 
[Almgren and Chriss, 2000") , [Bouchard et al., 2011| or Pages et al., 2012] for quantitative 



approaches of optimal slicing). At a St minutes time scale (say St = 5), an important 
variable of a trading process is its ''participation rate" Pm-st] the trader (manually or tuning 
parameters of some trading robots), succeeds in obtaining Pm-stVmSt shares during the mth 
interval of St minutes when the whole market traded VmSt shares. 

The adequate trading rate is difficult to achieve. First because its optimal value is a func- 
tion of market conditions during the whole trading process (i.e. it is not causal). Secondly, 
market conditions are either price driven (like the volatility, the presence of price jumps or 
trends, correlations between some assets, etc) or hquidity driven (the bid-ask spread, the 
market depth, etc); that all of these are difficult to anticipate or predict. Moreover, even 
if the theoretically optimal trading rate would have been known in advance, the actions of 



the trader himself have an impact on future market conditions Moro et al., 2009 . A trader 
is thus continuously monitoring his trading rate in conjunction with market conditions to 
check: first that ex-post his trading rate has been close to the expected one, second that a 
change in his trading rate does not come from an unexpected change of market conditions, 
and last that he is not impacting the price formation process. The faster he understands 
what is happening to the trading process, the more efficiently he will be able to react. 

The motivation to buy or sell does not change the way to apply the "influence analysis" 
methodology presented in this paper. It can be driven by long term considerations (typical 
for orders sent by institutional investors to executing brokers' trading desks), to hedge a 
derivative portfolio, to implement an arbitrage strategy, or even inside a market making 
scheme (see [Gueant et al., 20T2] ). For all of these motivations the same trading process 
takes place: a human being monitors a trading rate according to performance criteria (that 
are specific) and tries to adjust his trading rate as fast as possible to take into account 
changes in market conditions. The methodology proposed in this paper provides decision 
support to the trader. 



Performance criteria of a trading process. In our framework, the proxy for the target 
of the trader is called his performance evaluation criterion. Let us define a few possible 
criteria: 
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• for a market-maker: a decreasing function of his inventory imbalance, and his profits 
are good performance criteria; 

• for a brokerage trading desk: a decreasing function of the distance to a fixed participa- 
tion rate (for instance 10% or 20% of the market traded volume), the obtained average 
price compared to the Volume Weighted Average Price (VWAP) of the market, or to 
the arrival price, or to the close price, are meaningful criteria; 

• for an arbitrageur: a decreasing function of his tracking error, and his profits should 
be chosen. 



Description of the market context. In addition to the performance evaluation criteria, 
we will need market descriptors to quantify the market context. They will be used by the 
proposed methodology to build an online understanding of the causes of bad performance. 
Typical market descriptors are: 

• Price driven market descriptors: 

— prices returns or price momentum (signed to be in the same direction as the side 
of the monitored order) are important since it is more difficult to buy when the 
price is going up rather than when it is going down. 

— The volatility is a common proxy for the amount of uncertainty in the price 
dynamics. Its influence on trading performances is not straightforward since some 
volatility can help to capture passively some flows, but a too high volatility level 
can lead to adverse selection. Moreover, market impact models link the volatility 
to the impact of a trade a negative way (see Almgren et al., 2005| ), meaning that 



a too high level of volatility is negative for almost all trading processes. 

Liquidity driven market descriptors: 

— the bid-ask spread is the distance (in basis points of the current price) between 
the best ask price and the best bid one. It thus describes the state of the "auc- 



tion game" between market participants (see Lehalle, 2012 for more details and 
terminology) . 

— Market traded volume is an important characteristic of the market since it is 
easier to buy or sell when the market is active than when nothing is traded. 

— The visible size at first limits (also called average volume on the books) can also 
be used to quantify the current market depth. 

Any other characteristic of the trading instruments can be added in the analysis. For 
shares, modal variables like the sector of the stock or the market capitalization of the listed 
firm are of importance. 
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Anomaly detectors. To be able to capture the causes of bad trading performance, it is 
often useful to have access to information other than the averaged values of market variables. 
For instance price jumps, price trend changes, volume peaks and crenels are not captured by 
averages and we would hke to take them into account in our influence analysis methodology. 
Section |3] shows how to build such detectors that will be used in the analysis. 



2.2 Trading orders 

We consider a portfolio of at most K trading orders T(A;), k = 1, . . . , K driven by automatic 
trading algorithms, and supervised by one or more traders. Each trading order is defined 
by a few "static" variables such as buy/sell label, order size, trading place, section, country, 
capitalisation, free-float, benchmark type (VWAP, arrival price, etc), etc. In our intra- 
day benchmark studies a portfolio typically involves 200 < Kt < 700 active orders at any 
arbitrary 5 minute time slice. 



2.3 Market descriptors 

Each trading order T(A;) focuses on a speciflc asset whose dynamics is recorded at each time 
point t through a flxed number of basic "market descriptors" (k) , {k) , (k) , . . .; in 
our benchmark study below, we have focused on a subset of the following market descriptors: 

. = Volatility 

• = Spread 

• = Momentum in bid-ask spread 

• = Momentum in bp 

This list can be augmented by the rarity scores Score (M*) of the market variables M*. These 
scores are deflned by Score(M*) = F'^{M^) where is the cunulative distribution function 
of Mi. Each such score necessarily has a uniform distribution Borovkov, 1998| . If poor 



performance on a given set of stocks is due to a strong increase in the volatility level, the 
concrete cause may either be due to volatility reaching an " absolute" psychological threshold, 
or to volatility being high relatively to its usual levels. In this last case the volatility score 
will be a better explanatory factor for poor performance. We will use here the following 
scores, increasing our number of market variables: 

• = Volume Rarity Score 

• = Volatility Rarity Score 

• = Spread Rarity Score 
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Figure 1: Heatmap of online performances of a traded basket of 7 orders (top) in conjunction 
with values of one explanatory variable (the volatility; bottom), the correspondance between 
the two heatmaps is not obvious. The abcissa represents the evolution over five-minute time 
shces. 



Figure [3] displays typical intraday (January 14, 2011) plots of the time series correspond- 
ing to the 7 basic market variables listed above for an anonymous stock. 

We also display heat map representations of rarity scores for multiple stocks in Figures [T] 
and|4| where each row displays the time series of rarity scores for one single stock (associated 
to one trading order in our benchmark data), and each column represents one time slice. 
Clearly "extreme" rarity scores tend to appear in clusters, and to co-occur across multiple 
stocks. 

As will be seen below, high co-occurrence frequency of "dynamic anomalies" such as 
peaks, jumps, etc within a group of stocks tend to "explain" simultaneous lack of performance 
for the corresponding trading orders. 
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Figure 2: Heatmap of online performances of the some basket of 7 orders (top) in conjunction 
with the scores of the same explanatory variable (the volatility; bottom), compared to Figure 
[T| one can see that the scores change the high and low values of the explanatory variables, 
giving birth to more potential conjunctions with bad trading performances. 



2.4 Trading performance evaluation 

We also select a "trading performance evaluator" PE providing at each time point t, and for 
each active trading order T{k), a quantitative evaluation PEf(A;) for the current performance 
of T(A;). 

In our study we have selected by default PE = "slippage in bid-ask spread" {slippage 
being the average price of the order minus the benchmark -VWAP, arrival price, close price, 
etc- for a sell order, and the opposite for a buy order), but there are no restrictions on the 
user choice for this PE variable. In particular, other examples of PE include "slippage in 
bp" (basis points), "Slippage in Dollars", "Absolute value of slippage in bid-ask spread", 
etc. We are assuming that degraded performances are associated to low values of PE. For 
each trading order T{k) the performance evaluator and the 7 market descriptors are volume 
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averaged over successive time slices of arbitrary duration (set at 5 minutes for our benchmark 
study). Thus we generate 8 time series M}{k)^ . . . , Mj{k) and PEt(/c) indexed by time shces 
t. These time series generically have missing values since orders do not necessarily begin or 
end at the same time. Fix a low percentile threshold q such as g = 5% or g = 3% to binarise 
the performance evaluator. 

At time slice t, call Kf < K the number of currently active trading orders T{k). The 
q%- quantile of the corresponding Kf performance evaluations PEt{k) is denoted by if We 
consider it as a PE-threshold, separating "bad trading performances" (tagged "1") from 
"normal trading performances" (tagged "0"). We then binarize the performance evaluations 
PEt{k) by setting 

Yt{k) = l if PEt{k)<it, 
^ ' Yt{k) = if PEt{k) > it. 

In Figure |3} we plot an example of the intraday behaviour of a trading algorithm. Its trading 
performance evaluation PEj can observed within a trading day, in real-time, like some of the 
market context variable we used. A trading order may or may not be active at a given time 
slice as observed in Figure [TJ 

Figure |4] displays synchronous intraday plots of trading performance evaluations in con- 
junction with the values of a few selected market variables. An essential goal of our method- 
ology is, for each fixed time slice, to quantify on line the current influence of a market 
variable on trading performance degradation. Our automated online influence quantification 
replaces expert visual inspection of current trading orders performances, to identify critical 
market variables explaining trading performance degradations. For instance, visual inspec- 
tion of Figures [T] and |4] will naturally "explain" the low performances observed at time slices 
t = 39,40,41 by the obvious trend changes simultaneously observed on rarity scores as well 
as by the volatility peak. 



3 Online anomaly detectors 
3.1 Anomaly detection 

Online anomaly detection is a critical step in many applications, such as safety of complex 
systems, safety monitoring in automotive or aeronautics industries, remote health monitoring 
in biomedicine, real-time quality control for industrial production lines, etc (See Aviv, 1991 



Basseville and Nikiforov, 19931 [Basseville, 19881 [Gustafsson, 2000| [Lehalle and Azencott, 2004] ) 



In the context of trading performance online monitoring, it is also quite natural to system- 
atically enrich raw market descriptors by automated detection of anomalies affecting their 
dynamics. We have thus developed algorithmics dedicated to the online implementation of 
this processing step. 

The occurrence of visually evident anomalies can be detected by algorithmic tracking 
of local regime changes in market descriptors dynamics, and may have potentially strong 
influence on performance degradation for the corresponding trading orders. We have hence 
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Figure 3: Example of intraday behaviour of performance variables for a given order. Top: 
the prices (the variations of the average obtained price is in gray, the market VWAP in dark); 
bottom: the cumulated traded volume (grey) and market volume (dark). 

developed and implemented a set of 3 parametrized anomaly detectors, dedicated to the on- 
line identification of "significant" Peaks or Crenels, Jumps, and Trend Changes on generic 
time series. These 3 detectors automatically locate emerging anomalies, quantify their in- 
tensities, and filter them through adjustable gravity thresholds. 

3.2 Building online detectors 

Consider a generic discrete time series Ut. A smoothed "baseline" BUt is generated as 
a moving local median of Ut- One then computes the local standard deviation at of the 
"noise" Ut — BUt, and in turn, this defines "outlier" values of Ut. Our three online anomaly 
detectors are based on local trend extractions at each time slice t by fitting linear or quadratic 
regression models on short moving time windows to the left and the right of t. The detector 
parameters have simple geometric interpretations for the users and are kept fixed during 
online influence analysis. Each anomaly detector is dedicated to a fixed type of anomaly, 
and generates a binary time series encoding the presence or absence of this anomaly type at 
successive time slices of Ut- 
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Figure 4: Conjunction of the performance (top curve) of one traded order (first line of Figure 
|4j) with the market context (bottom heatmap). 



Peaks/ crenels detector 

A "peak" is the sudden occurrence of a high "outlier" value of Ut- More generally, a "crenel" 
is a cluster of successive high "outliers" with approximately equal values. Each crenel is 
described by 3 geometric "crenel features", namely, its time duration, its thickness (i.e., ab- 
solute difference between highest and lowest crenel points), and its height above the baseline 
BUf. Minimal threshold values are imposed on these 3 features, as well as a minimal time 
gap between successive crenels. 

To detect peaks and/or crenels on the series Ut, one first extracts outliers with respect 
to the baseline BUf, then one applies simple filters to detect local geometric configurations 
of outliers which satisfy the threshold constraints imposed on the three "crenel features" 
described above. If a peak or crenel is detected at time t, then "peak/crenel intensity" 
Peakt is set equal to "height" of the peak/crenel above the baseline. If no peak or crenel is 
detected at time t, one sets Peakt = 0. 



12 




Figure 5: The three abnormal patterns targeted by our three anomaly detectors: (1) price 
trends, (2) price jump, (3) volume peak, (4) volume crenel. 

Jumps detector 

A "jump" at time t is a sudden level change between the Ug values on finite time windows to 
the left and to the right of t. Bona fide jumps are described by 2 features, namely, a duration 
2L and a minimal jump size A. For each t, one fits two distinct quadratic regressions to the 
baseline BUg, namely Reg~ for (t — 1 — L) < s < (t — 1) and Reg~^ for t < s < {t + L), 
where L is a fixed parameter. 

A jump is detected at t if the "jump size" JS{t) = \Reg^it) — Reg~{t)\ is larger than 
A, provided the two regressions have small enough residuals. If a jump is detected at time 
t on the series Ut, then "jump intensity" Jumpt is set equal to the "jump size" JS(t). If no 
trend change is detected at time t, one sets Jumpt = 0. 
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Trend changes detector 

Bona fide "trend changes" are described by 3 features, a duration 2L, a minimal slope change 
X, and a continuity modulus e. For each t, one fits as above two quadratic regressions to 
the basehne BUs, namely Reg~ to the left of t and Reg~^ to the right of t. Call a^, a" the 
slopes of Reg~^, Reg~, and define the "trend change size" 

TCS{t) ^\a+-a-\. 

A local "trend change" is detected at time t if Reg~^ , Reg~ have sufficiently small residuals 
and verify 

TCS(t) > A ; and \Reg+(t) - Reg'{t)\ < e. 

If a Trend Change is detected at time t on the series Ut, then "trend change intensity" Trendt 
is set equal to the "trend change size" TCS{t). If no jump is detected at time t, one sets 
Trendt = 0. 

4 Probabilistic framework for influence analysis 
4.1 The enriched set of explanatory factors 

For each trading order T(A;) and each market descriptor , onhne analysis of the time series 
t Ut = Ml{k) by the 3 anomaly detectors progressively generates 3 time series of "anomaly 
intensities" Peakl{k), Jumpl{k),Trendl{k) which respectively encode the "anomaly intensi- 
ties" of the Peaks/Crenels, Jumps, Trend Changes detected on the time series Ut- Applying 
the 3 anomaly detectors to our 7 basic market descriptors yields a set of 3 x 7 = 21 time 
series A], . . . , Af- of anomaly intensities. Let At — Al be any one of these 21 anomaly inten- 
sities. A high value of detected at time s may have a degradation influence on trading 
performances observed not only at time slice s but also on performances observed at later 
time slices t e [s, s H- r], where the short time lag r is fixed (as a user selected parameter). 
When we analyze below the influence at time t of detected anomalies, we will hence take 
account of all the recent anomaly intensities Ag where t — t < s < t. To this end, we will 
replace each one our 21 anomaly intensities At by a smoothed anomaly intensity at scale r 
[[74(|t]] defined by the following formula, 

[[At\T]] = max As. 

t-T<S<t 

From now on we will assume that a time scale r is fixed, and note [[At]] instead of [[Af|r]]. 
Note that [[At]] > records the maximal gravity of recent anomalies (of fixed type) affecting 
the dynamics of a fixed market variable, and that for "most" time slices t one has [[At]] = 
for most scales r. We thus generate online N — 2S time series X^,. . . of explana- 
tory factors, namely the 7 current market descriptors Ml, . . . , Mj and their associated 21 
smoothed anomaly intensities [[Aj]], . . . , [[^j^]] at chosen time scales. 
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At each time slice t, and for each trading order T{k), denote by ^t{k) G the vector 

Mk)=[Xl{k),...,X^{k)], 

which regroups the current values of our N = 28 explanatory factors. 

In the online context, the K x N time series of explanatory factors become progressively 
available, and provide at each time slice t the Kt x N incoming new values Xl{k), where 
1 < k < Kf < K, 1 < j < N and Kt is the number of trading algorithms handling active 
orders at time t. These factors are viewed as potential "causes" for eventual degradations 
affecting the current binary performance evaluation Yt{k) of trading order T{k) where Yt 
binarizes the performance evaluations PE. The goal of our online influence analysis is to 
identify at each time t the explanatory factors which have the most significant degradation 
influence on current trading performances of the whole portfolio. 

4.2 Probabilistic framework 

To analyze functional relationships between trading performance evaluations and explana- 
tory factors, we introduce a time dependent probabilistic framework. 

Fix any time slice t. We consider that each one of the Kt trading order T{k) currently 
active at time t has been extracted at random from a very large finite pool fl of "virtually 
observable" trading orders T{u) , with u = 1,2, . . . , K, where the fixed cardinal ^ of f2 is 
much larger than Kt. 

We consider as a probability space endowed with the uniform probability. The binary 
evaluations Yt{u) of current trading performances defined in ([T]) can then be viewed as a 
single binary valued random variable Yt defined on Q, verifying for each u & Q 

Yt{co) = l if FEtiu)<it, 
^ ^ Yt{u) = if PEtiu) > if 

Similarly all our explanatory factors X}, j = 1,2, . . . , N, can be viewed as real valued 
random variables Xf {u) defined on Q and for which we have only observed the Kt values 
currently available at time t, namely the values Xl{k) G M for i = 1,2, Kt. Then 
= [Xt, . . . ,X^] becomes a random vector defined on Q, with values in M^. Our online 
influence analysis is performed anew at each fixed time slice t and involves only the currently 
available Kt joint observations of the random vector Xt and of the random variable Yt. For 
each fixed t, we will often omit the subscript t and adopt the abbreviated notations 

Y = Yt; X^=Xl- X = Xt = 

5 Binary valued predictors and their predictive power 

The time slice t is kept fixed and deliberately omitted in this whole section, where we 
introduce the precisely relevant notions of predictors, predictive power, and quantitative 
influence of explanatory factors. 
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5.1 Binary predictors 



On a probability space fl consider an arbitrary random vector X = X^, . . . of "ex- 
planatory factors" and an arbitrary binary valued random variable Y. We call binary valued 
predictor of the binary random variable Y any random variable Y which is a deterministic 
function of X. Clearly Y is then necessarily of the form Yb = 1b{X) where 1^ is the indi- 
cator function of a fixed but arbitrary Borel subset of — )■ {0; 1}. 

This class of binary predictors is naturally imbedded in the convex set of "randomized" bi- 
nary predictors 1^ of y , indexed by arbitrary "decision functions" G where $ is the set 
of all Borel functions (p{x) defined for x e and such that < < 1. The predictor 
Y^ defined by each such verifies 

P{Y^ = 1\X) = <j>{X) ; P{Y^ = 0\X) = 1 - <I>{X) 

Note that $ is a closed compact convex subset of L^{R^), endowed with its weak topology 
as dual of Li(i?^). 



5.2 Probabilities of correct predictions 

For randomized binary predictors 1^ of the true but yet unknown Y, the accuracy of Y 
is usually characterized by the two conditional probabilities of correct prediction p^ and 
or equivalently by the two absolute probabilities of correct prediction — P^{(f)) and 
po = P0((/)), defined by 

pi = P(f = l|r = l) and =p(y = l;y = 1) 
pO = P(y = 0|r = 0) and p° = p(y = 0;y = 0) 

(3) 

The obvious expressions 

Pi = E(0(X)ly=i) ; and P° = E((l - 0(X))ly=o) 

show that P^(0) and P°(0) are weakly continuous functionals of G Intuitively the 
predictive power of a predictor should be an increasing functional of and p^, or equivalently 
an increasing functional of P^ and P^. Indeed the classical 2x2 confusion matrix for the 
estimation of Y by the binary predictor Y is determined by as follows: 



y = Y = l 





1 - pi 


1 - IP 





This motivates the following definition of predictive power. 
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Figure 6: Confusion Table for the estimation of F by a binary predictor Y 



5.3 Predictive power 

For any random vector of "explanatory variables" X G and any binary random variable 
Y jointly defined on a probability space Q, the joint probability distribution /i of {X,Y) 
belongs to the compact convex set of all probabilities on R'^ x {0; 1}. Select and fix any 
non-negative continuous function Q{^,a,b) of fi & A4 , < a < 1,0 < b < 1 which is a 
separately increasing function of a and b. 

We then define the predictive power 7r(0) = 7r(yU, (p) of each randomized binary predictors 
n of y by 

(4) 7r(0) = Qif^,P\P') = g(^,/xV,/iy) 

where /i^ = F{Y = 1) and mu^ = P(F = 0). Note that in our benchmark study below, 
due to our adaptive binarization of trading performance, the probabilities /i^ and fi^ will be 
constant in time and will have known pre-assigned fixed values such as 3% and 97%. 
The predictive power 7r(/i, 0) is then clearly continuous in (/x, (p) for the weak convergence 
topologies of Ai and 

We shall see in the next paragraph that this definition of predictive power is compatible, 
and actually extends these definitions actually extend the predictive power quantification by 
relative entropy. 

The functional P^, P") will be called a predictor quality function. Here are basic exam- 
ples of functions Q often used in the accuracy analysis of predictors. 

• Qrmn=mm{P\P'^}, 

• Qweight = uP^ + {1 — u)P^, for somc < u{jj,) < 1, 



5.4 Predictive power based on mutual information and/or relative 
entropy 

An information theoretic characterization for the predictive power of a predictor Z = Y oi 
Y is the amount of information which Z reveals on the yet unknown variable Y . This has 
classically been quantified by relative entropy criteria such as the mutual information ratio 
MIR(Z, Y) (see for instance [Brillinger, 2004 Azencott et al., Mougeot and Azencott, 2011 



[Billingsley, 1965 Khinchin, 1957 Shannon, 1948| ). Recall that the entropy H{U) of a ran- 
dom variable U taking only a finite number of values Ui is given by 



H{U) = -^P(f/ = Ui)\ogF{U 



Ui 



The mutual information ratio MIR(Z, Y) between Y and its predictor Z 

H{Z) + H{Y)-H{Z,Y) 



Y is defined by 



MIR(Z, Y) 



H{Y) 
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where H{Z), H{Y), H{Z, Y) are the respective entropies of the three random variables Z, Y 
and {Z,Y). Here Z and Y are binary valued and {Z,Y) takes only 4 values. 

The ratio M1R(Z, Y) which is directly related to the relative entropy of Z with respect to 
Y lies between and 1, and reaches the value 1 if and only if Y is a deterministic function of 
Z. Good predictors Z oiY should thus achieve high values of MIR[Z,Y). Indeed we have 
the following result . 

Proposition 1. Fix any random vector of "explanatory variables" X G and any binary 
random variable Y jointly defined on a probability space and call n the joint probability 

distribution of {X,Y). Assume that Y is not deterministic, so that H{Y) > 0. Then for 
all randomized binary predictor Z = Y^ of Y defined by arbitrary Borel decision functions 
< < 1, the mutual information ratio MIR{Z,Y) is a separately increasing functional of 
the two conditional probabilities of correct decisions p^ = P(F = 1\Y = 1) and p^ = F{Y = 
0\Y = 0), provided p^ > 1/2 and p° > 1/2. 

Proof: The joint distribution of {Z,Y) on {0; 1}^ is easily seen to be determined by the 
3 parameters p^,p^, and /x^ = F{Y = 1). Since H(Y) is fixed, MIR{Z,Y) is an increasing 
linear function of H{Z) — H{Z,Y). An elementary computation easily proves the identity 



where Ent{p) — — plog(p) — (1 — p) log(l — p) and /jP — 1 — jj}. Since the entropy Ent{p) 
decreases with p for p > 1/2 , this concludes the proof. 

5.5 Generic optimal randomized predictors 

We now characterize the randomized predictors achieving optimal predictive power.. 

Proposition 2. Fix a random vector X G i?^ of explanatory factors and a target binary 
variableY . LetO < v{x) < 1 be any Borel function ofx G such thatv{X) — ¥{Y — 1 | X) 
almost surely. 

For any Borel decision function (/> G $, define the predictive power of the randomized 
predictor Y^ by 7i{(f)) = Q{fi, P^{(f)), P'^ {</))) , where Q is a fixed continuous and increasing 
function of the probabilities of correct decisions P^,P^ . Then there exists G $ such that 
the predictor Y^ has maximum predictive power 



Any such optimal Borel function < < 1 must almost surely verify, for some suitably 
selected constant < c < 1. 



H{Z)-H{ZX) 



li^Entip^) 



li^Entip^) 



Tr{ip) — max7r(0) 



(5) 



ip{X) = 1 for v{X) > c ; ip{X) = for v{X) < c 
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Proof: Predicting the actual value of the yet "unknown" binary random variable Y given 
the random vector X of explanatory variables is clearly equivalent to deciding between the 
two formal "hypotheses" : 

i/o : {r = 0} versus : {Yt = 1} 

on the basis of the observed X, which is a standard testing problem [Lehmann and Romano, 2005 . 
Any borelian "rejection region" B G defines the binary valued predictor 1b{YX of Y 
which rejects if ° whenever X G i?. More generally any Borel decision function < (f){x) < 1 
defines the binary predictor Y^ which, given the observed X rejects with probability 
4>{X). This test has "confidence level" a = 1 — P", and "detection power" where P^,P^ 
are the probabilities of correct decisions for the predictor Y^. 

Classical testing of versus involves the likelihood function defined with probability 
1 by 

^ ^ P(F = 0|X) l-v{X) 

By Neymann-Pearson theorem ( [Lehmann and Romano, 2005| , for each "confidence level" 
< (1 — a) < 1 there exists a randomized binary predictor Y^ which maximizes P^{(t)) 
among all predictors verifying P'^{4>) > 1 — a. Morover one can find c > such that the 
Borel function < ^^(a;) < 1 verifies almost surely 

(6) ipiX) = 1 for L{X) > c ; ^{X) = for L{X) < c 

Since both P^ = P^{(f)) and P^ = P^{(p) are weakly continuous functions of (j) G $, the 
predictive power 7r(0) = Q{fi, P^^cj)), P^{(f))) of is also weakly continuous in and thus 
must reach its maximum on the weakly compact set $ for some Borel function 0* G 

Since P^,P°) is an increasing function of P^ and P", we see that the optimal pre- 
dictor Y^* must necessarily maximize P^{(f)) among all predictors verifying P^{(f)) > P°(0*). 
Select the confidence level 1 — a = P°(0*) and apply the Neyman-Pearson theorem just re- 
called above to conclude that there exists a threshold c > and an associated i/j of the form 
(|6|such that P'^{ip) > P^{(j)*) and P^iip) > P^(0*). This implies the following inequality 
between predictive powers 

irW = g(/i, P\ij), P°(^)) > g(^, Pi(0*), P°(0*)) = 7r(0*) 

and hence nlip) = 7r(0*) sice Y^* has maximal predictive power. This clearly achieves the 
proof. 

Definition 1. In the preceding situation we will quantify the capacity of the random vector 
X to "explain" the target binary variable Y by an "influence coefficient" I{X,Y) defined as 
the predictive power 7c{tlj) of an optimal randomized binary predictor Y^ ofY. More precisely 
the influence coefficient o/X on Y is given by 

X(X,F) = TT{ip) = max7r(0) 
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Clearly, once the quality function Q is selected and fixed, the influence X(X, Y) depends 
only on the joint probability distribution yU of (X, Y). 

The notion of influence coefficient is immediately extended to arbitrary subsets of ex- 
planatory factors. To any subset of indices G C {1, 2, . . . , iV}, we associate the random 
vector = {X^ \ j G G} of explanatory factors, with < N, and we define as above 

the influence coefficient by 

(7) J{G) = X(X^, Y) < I{X, Y) 

6 Quantifying on line the influence of groups of ex- 
planatory factors 

6.1 Benchmark study context 

In our intraday data study below, at each fixed time slice t, we observe simultaneously 
on Kt < 700 trading lines the current values of our random vector X G of X = 28 
explanatory factors (see section [S]), and the corresponding current values of the binarized 
trading performances Y. Ideally, for each subgroup X'^ = {X^ \ j G G} of explanatory 
factors, where G G {1,2,..., N}, we want to estimate the current influence coefflcient J{G) 
of X'^ on Y by the formula Q, using only the current sample of Kt jointly observed values 
of {X^,Y). Statistical reliability of the J{G) estimates will lead us below to consider only 
groups G of small cardinal. 

Our goal was to determine, at each time slice t, one or possibly several groups G of 
explanatory factors having small cardinal and high influence J{G) on current trading per- 
formances, and to specifically focus on detecting small groups of current "major causes" for 
the trading performance degradations just observed at time t. 

To this end we selected a class of asymmetric predictive power functionals parametrized 
by one parameter 70% < r < 100%, called here the "floor predictive power". For each 
predictor Y of Y with current conditional probabilities of correct predictions p^,p^, the 
predictive power of Y was computed by 

/gx Qr(p\p°) = p\ if min(p\p°) > r%, 

^ ' Qr(p\p°) = 0, if min(p\p°) < r%. 

Note that Qr emphasizes strongly the probability of correctly predicting bad trading 
performances. The associated influence coefficients J{G) then quantify the current impact 
of the explanatory factors X'^ on "performance degradation". In our benchmark study of 
intraday datasets, systematic tests led us to fix r = 85%. 

6.2 Influence computation: accuracy analysis 

As in the preceding subsection, the time slice t is fixed and we keep the same notations. We 
now analyze how to implement a numerical computation of the current influence coefficients 
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J{G) for small groups of explanatory factors. Let m be the cardinal of G and denote 
X'^ = Z = [ Z^,...,Z"^ ]. To compute J{G) we need to compute an optimal decision 
function < ip{Z) < 1 , maximizing the predicting power of the predictor Y defined by 
F{Y = 1\Z} = ip{Z). By proposition^ , for each value z oi Z currently observed at time t, 
this requires first to estimate by empirical frequencies v{z) the probabilities 

v{z) =p[(z = 2)n(r = 1)] 

and then to find an optimal threshold < c < 1 for the v{z) values. At time t the estimates 
v{z) are derived only from the moderately sized sample of Kt = 700 of currently observed 
joint values for the pair {Z,Y). By construction of the binarized trading performance (see 
section [2]) the empirical frequency {Y = 1} is kept constant equal to 3% < q < 5%. So the 
number of currently observed values z of Z for which v{z) is non zero will always be inferior 
to Kf X 5/100 ~ 35. Empirical thresholding of the v{z) at time t can then obviously be 
restricted to exploring at most 35 values of c. 

We seek then an optimal decision function < ipi^) ^ which according to formula 
(|5|, should be associated to some threshold 1 > c > 0, with = 1 when v{Z) > c and 

ip{Z) = when v{Z) < c. To achieve statistical robustness and fast online computation, we 
restrict ip{Z) to only take the values or 1, and we thus impose ip^Z) = lv{z)>c- At time t, 
the predictive power g{c) = ir^ip) = Q{P^,P^) of the estimator Y = ipi^Z) depends only on 
its current probabilities of correct prediction 

= ¥{{v{z) >c)n{Y = 1)} p° = F{{v{z) < c) n (r = o)} 

At time t, these probabilities are approximated by current empirical frequencies P^,P^ after 
replacing v{Z) by the current estimates v{Z). This yields an estimated predictive power 
g{c) and the optimal threshold c* is selected from at most 35 threshold values by a trivial 
maximization of g{c). The current estimate of the influence coefficient of the group of factors 
G is then J{G) = g{c*). 

The practical accuracy of the estimates J{G) is strongly determined by the accuracy of 
the estimators v{z) of the probability v{z). Given the moderate size of Kt , we deliberately 
discretize all our explanatory factors so that the vector Z = X'^ is restricted to take only a 
finite number s of distinct values. Under the favourable assumption of approximate indepen- 
dence of the Kt trading orders observed at time t the errors of estimations e{z) = v{z) —v{z) 
have standard deviations v{z){l — v{z))/Kt and hence the relative errors e{z)/v{z) on v{z) 
are of the order of 1/ ^^/v{z)Kt, which is inferior to l/y/wKt where w = mmzv{z). 

The optimal threshold c* is computed above by empirically thresholding a set of at most 
35 estimated v{z), since 5% x Kt is of the order of at most 35 in our data set. The relative 
error of estimation on c* will then be roughly of the order of 1/y/wKt. Thus to obtain a 
relative error inferior to, say, 11% for the estimation of the optimal c*, one needs to at least 
impose the constraint < 1.21/100, wich yields 1/w < ^'Hq* ■ Since P(Z = z) > w 

for each z in the currently observed range of Z = X^ which after discretization contains 
s values, we have \ > sw and hence s < 1/w < ^^f^^- In our benchmark intraday data. 
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the number Kt of orders active at time t is Kt = 700 , and hence the cardinal of observable 
values for the discretized random vector Z = should be less than 8. 

Maximal discretization of each explanatory factor is reached when each factor is binarized. 
But even in this case, since the vector Z = is of dimension m = cardinal(G), the number s 
of distinct values for X'^ is s = 2"* and hence we must still impose the constraint cardinal(G) 
< 3. 

These cardinality constraints show that: 

• for cardinal(G) equal to 3, all three factors in X*^ must be binarized; 

• for cardinal(G') equal to 2, one of the two factors in X^ must be binarized and the 
other one must be discretized with at most three values; 

• for cardinal(G) equal to 1 , the single factor involved must be discretized with at most 
8 values. 

In our benchmark study of intraday data the number Kt of trading orders concretely ranged 
anywhere between 400 and 700. The preceding analysis thus showed that at each time t and 
for any group G of explanatory factors, the computation of the influence coefficient J{G) 
could only be statistically robust if cardinal((j') was equal to 1 or 2 and if each explanatory 
factor was binarized. In practice, at each time t, a key computing task is to select an optimal 
binarization of each explanatory factor, as indicated below. 

7 Influence computation: optimized discretization of 
explanatory factors 

7.1 Influence computation for a single explanatory factor 

Consider any real valued single explanatory factor Z having a continuous conditional density 
function w{z) given Y = 1. To predict Y given Z the best binarization of Z should select a 
subset B of S maximizing the predictive power of the predictor 1b{Z). An easy extension of 
the proposition [2] proved above, shows that any optimal B should be the set of all 2; G M such 
that w{z) > c for some c > 0. Thus an optimal B must be a closed level set L of the unknown 
conditional density function w{z). The family of all closed sets in M is well known to have infi- 
nite Vapnik-Cervonenkis dimension (see [Vapnik and Chervonenkis, 1971| , |Vapnik and Chervonenkis, 1971 
So, in view of Vapnik's theorems on automated learning (see [Vapnik, 2006| , [Cristianini and Shawe- Taylor, ' 
empirical optimal selection of B among all closed sets will have weak generalization capacity, 
increasing extremely slowly with the number Kt of data. This has naturally led us to select 
sub-optimal but much more robust classes of predictors, whith radically reduced Vapnik- 
Cervonenkis dimension. 

In the cases where w{z) can be considered as roughly unimodal or monotonous, the level 
sets of w are unions of at most two disjoint intervals. We thus deliberately restrict our class 
of binary predictors of Y to two-sided ones: 
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Definition 1 (Two-sided binary predictor). Two-sided binary predictors are of the form 

hg = 1b{Z) where B is the union of the two disjoint intervals {—oo,9~) and (^"'",+00) , 
indexed by the vector 9 = {9~, 9+) e M^, with 9~ < 9^ . 



6~ 



B 
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Figure 7: Typical predictor defined by two intervals (—00, 9 ) and {9^ , +00). 

Note that he predicts bad trading performances if and only if the explanatory factor Z 
takes sufficiently large or sufficiently small values. Hence these estimators of trading per- 
formance degradation have an immediate interpretability for natural users of online trading 
performance monitoring. 

At time t, given the current Kt joint observations of the explanatory factor Z and of the 
binarized trading performance Y, an immediate counting provides for each 9 the empirical 
estimates and P° of the probabilities of correct prediction P^,P^ for the estimator he, 
given by 

p^ = P{(z < 9-) n (r = 1)} + P{(z > 9+) n (F = i)} 
p° = p{(^- < z < ^+) n (r = 0)} 

The predictive power t:{9) of he is then readily estimated by the explicit formula 

7r(^) = g,(p\p°). 

The influence coefficient X(Z, Y) of Z on F at time t is then estimated by maximizing Tt{9) 
over all ^ in M^. At time t, the set S of currently observed values of Z has cardinal inferior 
or equal to Kt- The previous formulas show that to maximize 7r(6'), we may in fact restrict 
both 9~ and 9^ to belong to S, so one needs only explore at most K'^ /2 values of 9. 

Clearly this computation tends to underestimate the influence X{Z, Y) . Nevertheless 
in our benchmark studies we have systematically applied this approach for the following 
reasons. 
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• the set of binary predictors has the merit of having finite Vapnik-Cervonenkis 
dimension equal to 2 , so that our empirical estimate of the maximum of 7t{6) will be 
statistically robust even for moderate realistic values of Kt = 700; 

• the immediate interpretability of the predictors hg enables user friendly online graphic 
displays of the explanatory factors currently having high influence on performance 
degradation. 

• at each time t, at most Kf basic operations suffice to implement the brute force max- 
imization of Tc{6) which generates our current evaluation of T{Z, Y). 

Note that the two optimal thresholds (6^,6^) will of course strongly depend on the time 
slice t. 

When the explanatory factor Z is one of the 21 smoothed anomaly detectors [[At]] > 

introduced above in section [3} the preceding implementation can be simplified. Recall 
that [[At]] records the maximal gravity of very recent anomalies of fixed type affecting the 
dynamics of a fixed market variable, and that for "most" time slices, [[At]] takes the values 
0. Thus it is natural to expect that only higher values of [[At]] to be potential explanations 
for currently degraded trading performances. So for practical applications to Z = [[At]] of 
the preceding approach, we may actually impose the constraint 6~ = 0, with essentially no 
loss of predictive power. 

7.2 A few examples for single explanatory factors 

The empirical strategy just presented to estimate the infiuence J{G) when cardinal(G) equals 

1 has been numerically validated on our benchmark set of intraday data. We now outline 
a few examples. Recall that our benchmark study used the predictive power functional 

= Qr{P^, P^) given above by formula Q, which is specifically sensitive to predictors 
capacity to detect degradations of trading performances. On our intraday data sets, we have 
methodically tested the values r = 70%, 75%, 80%, 85%, 90%, 95% for the "fioor predictive 
power" r ; the value r = 85% turned out to be the best choice for these data sets, and was 
adopted for all results presented below. 

Figure |8] illustrates for the fixed time slice t = 45, the predictive power of the predictors 
he based on the single market variable Z = "Momentum in Bid- Ask Spread" . 

The trading performance evaluator PE is the "slippage in bid-ask spread". The PE- 
thresholds it = £45 determining low trading performance is fixed at the 3% -quantile of all 
performance evaluations observed at time t = 45. 

The X and y axes in the graph (figure [s]) indicate the threshold values {6~,6~^) for the 
market variable Z = "Momentum in Bid- Ask Spread". The z axis displays the predictive 
power n{6) of hg. The red marker indicates at time t = 45, the estimated infiuence coefficient 
X(Z, Y) on Y for this specific market variable Z, which turns out to be equal to 100%. The 
threshold vector 6 = 9^^ which achieves maximum predictive power at time 45 is equal to 
(66.76, 3.87). At each time t, our 7 basic market variables can then be ranked on the basis 



24 



of their approximate influence values computed as above, which provides a ranking of their 
respective capacity to explain current bad trading performances. 

Predictive Power 



95.2 - 



79.4 - 



64.8 - 



50 - 



41.7 - 



36 - 



25.4 - 



19.4 - 



9.5 - 



e.^ ^t.^ <^'p 0,0,1 tCl^ t,c,e> ^^1 ^(.^ 

Figure 8: Predictive power as a functional of the two thresholds 6^ (y-axis) and 6^ (x-axis) 
of the market variable Volume Score at time slice t = 12. It can be seen that d~ lower than 
10 and 6^ around 65 generate an efficient predictor of bad trading performance during this 
time slice. 

8 Influence computation for pairs of explanatory fac- 
tors 

Again at fixed time t, we now sketch our online "optimized fusion" of predictors to estimate 
the infiuence coefficient J{G) when G is a group of 2 explanatory factors Z = [ Z^,Z^ ]. 
Our statistical robustness analysis above indicates the necessity to consider only classes of 
trading performance predictors having radically low Vapnik-Cervonenkis dimension. So our 
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predictive power maximization among predictors based on Z is deliberately restricted to the 
following class of predictors. 

Let be the set of all 16 functions mapping {0; 1}^ into {0; 1}. The class 'K will be the 
set of all predictors of the form 

/i(Z)=m(/(Zi),^7(Z2)) 

where m G '^2-, and the indicator functions f{z) and g{z) are both two-sided binary predic- 
tors in the sense of definition [T] The class of binary predictors 1-i has Vapnik-Cervonenkis 
dimension equal to 4. Hence the estimation of maximal predictive power within 1-i by empir- 
ical estimation of probabilities and P° on the basis of the current Kt joint observations 
of y, Zi, Z2 will be statistically robust. This provides at time t a stable estimator of J{G), 
which as above tends to undervalue the true J{G). 

In concrete implementation of this approach at fixed time t, we first select only pairs 
of predictors /(Zi), (7(^2) which already have reasonably high probabilities of correctly pre- 
dicting Y. To maximize the predictive power of h{Z) = m(/(Zi), 51(^2)) we need to select 
the best binary polynomial m among the 16 elements of ^2- We then impose m(0,0) = 
and m(l, 1) = 1, so that whenever the predictions of f{Zi), and g{Z2) agree, we also have 
h{Z) = f{Zi) = g{Z2). This "accelerated fusion" is fairly classical in multi-experts fusion 
(see [Alkoot and Kittler, 1999] ) and obviously provides an acceleration multiplier of 4 in the 
online computation of J{G). 

For groups Gofk = 3ork = 4 explanatory factors, one could estimate J{G) by simi- 
lar sub-optimal but implementable strategies. However the corresponding predictor classes 
have Vapnik-Cervonenkis dimensions 6 and 8, and their statistical robustness is hence much 
weaker in the concrete context of our intraday datas set, since at each time t the key Vapnik 
ratios Kt/6 and Kt/8 were resp. inferior to 120 and 70, values which are much too small 
and strongly suggested to avoid the estimation of J{G) for cardinal(G) > 3. 

9 Numerical results 

We now present the numerical results obtained by applying the above methodology to our 
benchmark dataset of intra-day trading records. 

9.1 Dataset (portfolio) description 

Recall that our intra-day benchmark data involve a total of 79 time slices of 5 minutes 
each (i.e. this portfolio has been traded from 8:55 to 15:30 London time), and that we are 
monitoring a portfolio of 1037 trading orders, with a maximum of 700 trading orders active 
simultaneously at each time slice. 

At each fixed time slice t, we compute the current influence coefficient for each one of 
our 28 explanatory factors, namely the 7 market descriptors themselves and the 3x7 
smoothed anomaly detectors monitoring the dynamic of these market descriptors. 
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These 28 explanatory factors generate 378 = 28 x 27/2 pairs of factors. By accelerated 
fusion as above, we compute, at each time slice t, the influence of each one of these 378 pairs 
of explanatory factors. 

Among these 406 = (378 + 28) groups of explanatory factors, at each time t, we retain 
only those having both conditional probabilities of correct predictions {p^,p^) larger than r%. 
Here r% > 70% is the user selected "floor predictive power". Note that each single factor 
or pair of factors retained at time t can predict current degraded performance degradations 
with a false alarm rate FAR = 1 — p° inferior to (100 — r)%. 

Among the retained groups of explanatory factors, we compute the maximal influence 
max Jt achievable at time t. We also determine the set of dominating groups of explana- 
tory factors, defined as the groups of 1 or 2 factors having an influence equal to and 
achieving the minimal false alarm rate (100 — r)%. 

9.2 Predictive power of market descriptors 

Figure [9] gives an heatmap of the predictive power of a selected subset of market descriptors 
on the whole portfolio. The display shows that: 

• no market descriptor is used before slice 60 (i.e. 14:00), meaning that there are no 
significant predictive links between bad trading performances and specific values of the 
descriptors. 

• Then the Volume Score has the capability to explain bad trading performance from 
14:10 to 14:20 and from 14:50 to 15:00. It means that during these two time intervals, 
bad performances occurred simultaneously with quite unusual levels of traded volumes. 

• The Volatility Score emerges as a complementary explanatory factor between 14:55 and 
15:05; orders with bad trading performances focussed on stocks having unexpectedly 
high volatility levels during these 10 minutes. 

• The Bid-Ask Spread Score conforts this automated diagnosis: a rare event did indeed 
degrade trading performances around 15:00. Keeping in mind that scores are computed 
according to historical values during last weeks, it means that for this portfolio, the 
worst performances occurred on stocks for which volumes, volatility, and bid-ask spread 
had abnormal values. 

It is interesting to note that, before scoring, market performances do not explain that 
well bad performances. 

9.3 How alarm zones explain bad trading performance 

Since the two-sided binary predictors are built to explain the degraded performances of 
the worst trading orders, it is easy to identify the most impacted orders one given market 
descriptor. Here we consider the trading order T(139) whose lifecycle is shown on Figure 
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Figure 9: Predictive powers of some explanatory variables (horizontal scale is time in slices 
of 5 minutes). 



|4| in order to visualize the impact of its Volume Score, Volatility Score, and Bid-Ask Score 
on the order performance. First note that this order has been active from 10:55 to 15:30 
London time, with a start time two hours after the launch of the portfolio. It means that 5 
minutes slices on this order are numbered from to 49; they have to be shifted by 30 to be 
synchronized with the time scale of the other portfolio orders. 



Alarm zones on the Volume Score. Figure [TO] shows alarm zones for the Volume Score 
of order T(139): the top subplot draws the value of the volume score through time and the 
associated alarm zones have been added on top of it, when triggered. 

For the whole portfolio (Figure [9]), alarm zones are triggered on the Volume Score from 
14:10 to 14:20 and from 14:50 to 15:00 (i.e. slices 32 to 32 and 40 to 41 for this specific 
order). They are drawn like "gates" from the low threshold {6^) to the high one (Of); if 
the value of the Volume Score is outside bounds for the given order: we thus state that "i/ie 
Volume Score contributed to the degradation of trading performance for order T(139)". 
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Figure 10: Auto adaptive alarm zones on the Volume Score explanatory variable for the 
order T(139) displayed in Figure |4| Top: four alarm zones are active, two realizations of 
the Volume Score exceed the auto adaptive thresholds and thus emerge as a highly likely 
explanation for the bad performance exhibited by this order (see Bottom graph.). 



It is important to note that if if our thresholds had not been adaptive as proposed by 
our ''influence analysis'^ methodology, they would have generated false explanations at the 
start of the order (around slice 4, i.e. 11:15 London time). 

Specifically for this order, the Volume Score does not enter the first alarm zone (around 
14.15); it is in line with the performance of order T(139) that is normal (bottom subplot of 
Figure 10). The boundaries of the next alarm zone (around 15:00) are crossed by the order 
Volume Score; indeed this order performs quite poorly at that time. 



Alarm zones on the Bid-Ask Spread Score. Only one alarm zone has been activated 
on the portfolio (around 15:00) and the advantage of the auto adaptive approach proposed in 
this paper is straightforward: a unique threshold for all the duration of the portfolio would 
clearly not have been able to separate the 41st slice of order T(139) from the others. 



Alarm zones on the Volatility Score. Once again it is clear that the alarm zones 
succeeded in isolating slices to efficiently explain the bad trading performance around 15:00. 
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Figure 11: Auto adaptive alarm zones on the Bid-Ask spread Score explanatory variable 
for the order of Figure |4j Top: one alarm zone is active, since the Bid-Ask Score exceeds 
the auto adaptive thresholds, and thus emerges as a quite likely explanation of bad trading 
performance ( as validated on the bottom figure). 

To summarize. When applied applied to our real portfolio of 1037 orders traded during 6 
hours and 35 minutes, the automated influence analysis methodology presented and studied 
from a theoretical viewpoint in this paper efficiently selects quite pertinent explanatory 
factors for degraded trading performance: 

• our alarm zones use thresholds that are automatically adapted online to successive 
time slices, as computed via the predicting power of two-sided binary predictors (see 
Definition [T]) based on market descriptors. 

• Our approach generates generates auto adaptive thresholds taking into account cur- 
rently observed synchronicity between the user selected performance criterion (chosen 



according to the trading goal, see Section 2.1) with market descriptors market descrip- 
tors. 

At each time slice, the computed adaptive thresholds on market descriptors apply to 
the whole portfolio, bad trading performance of orders for which market descriptors 
take values outside alarm zones are said to be explained or influenced by the given 
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Figure 12: Auto adaptive alarm zones for the Volatility Score explanatory variable for the 
order of Figure |4j Top: two alarm zones are active, one realization of the Volatility Score 
exceeds the auto adaptive thresholds giving and thus provides a highly likely explanation of 
the current bad trading performance (as displayed on the bottom graph). 

descriptors. We commented real examples to illustrate these automated selection of 
explanatory factors. 

• The added value of augmenting the state space of market descriptors using scores has 
been illustrated on several examples within our benchmark data set o trading orders.. 

10 Conclusions 

The approach presented in this paper quantifies in real-time, the negative influence (on 
trading performances) of currently detected abnormal behavior of market factors. The pa- 
per presents a theoretical framework to be used for TCA (Transaction Cost Analysis) and 
uses it for online TCA. Dynamic influence-based ranking of market factors provides in real- 
time the most likely causality links between current bad trading performance and currently 
detected abnormal behavior of market variables. To accurately capture the effect of anoma- 
lies detected on the dynamics of market variables, binary performance predictors based on 
anomalies have been extended to cover lagged recent occurrences of anomalies. In particular, 
this captures and quantifies the predictive power of crenels, jumps over multiple time steps. 
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etc. Our algorithms provide real-time evaluations for the current influence of speciflc market 

variables on currently observed trading performance degradations, with a small time-delay 
dependent on market liquidity. Moreover, influence analysis can generate efficient real-time 
answers to online queries by multiple traders as well as post-trade analysis. Our methodology 
enables fast in-depth dynamic evaluation of trade scheduling algorithms, and should help 
to quantify the comparative analysis of various trading algorithms. This will enhance new 
automated approaches to optimize the parameters of trading algorithms by intensive testing 
on historical data. Note that we do not address here the issue of picking potential explana- 
tory market factors, possibly from a very large pool of available factors. We consider here 7 
basic market variables as given arbitrarily, or as pre-identified based on expert knowledge. 
However, new variables could be automatically included as potential explanatory factors by 
scanning a large set of market variables and ranking them after computation of their re- 
spective influence on performance degradation. For instance, it is advantageous to include 
factors which are uncorrelated and have historically shown to provide reliable explanation 
for performance degradation. 
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